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Abstract 

This note explores, via information theory, 
the admissibility of certain nonlocal hidden-variable 
theories. Consider a pair of Stem-Gerlach devices 
with fixed nonparallel orientations that periodically 
perform spin measurements on identically prepared 
pairs of electrons in the singlet spin state. Suppose 
the outcomes are recorded as binary strings / and r 
(with l n and r„ denoting their n-length prefixes). The 
hidden-variable theories considered here require that 
there exists a recursive function which may be used to 
transform l n into r n for any n. This note demon- 
strates that such a theory cannot reproduce all the 
statistical predictions of quantum mechanics. Specifi- 
cally, consider an ensemble of outcome pairs (fr). 
From the associated^ probability measure, the Shannon 
entropies H n and H n for strings l n and pairs (l n ^i' n ) 
may be formed. It is shown that such a theory re- 
quires that \H n -H n \ be bounded - contrasting the 
quantum mechanical prediction that it grow with n. 

I. Introduction 

The class of inequalities initiated by Bell 1 do 

not absolutely exclude the possibility of hidden vari- 
ables underlying the phenomena statistically described 
by quantum mechanics. Hidden-variable theories of 
the so-called nonlocal variety are not constrained by 
Bell’s theorem. Although there is no pressing theo- 
retical reason for taking the existence of such a theory 
seriously, it is clear that one can only truly begin to 
understand quantum mechanics when one first under- 
stands what it is not. This note will attempt to make 
a contribution to this end. Here a seemingly not- a 
prion unreasonable class of nonlocal hidden-variable 
theories called the “computable hidden-variable theo- 
ries” (CHV’s) will first be defined for a particular 
thought experiment and then shown to be inconsistent 
with certain statistical requirements of quantum me- 
chanics. The reason for this procedure is to make ex- 
plicit, through the language of algorithmic informa- 
tion theory , 2 ’ 3 an aspect of quantum theory hitherto 
seldom discussed and then demonstrate the practical 
use of this aspect in answering foundational questions. 
This aspect is that the data obtained from identical 
measurements performed on identically prepared sys- 
tems is generally “algorithmically incompressible.” 


H. The Thought Experiment and the Result 

The thought experiment described is a modifi- 
cation of the standard one used for discussions of 
Bell’s theorem. Consider a pair of distantly separat- 
ed Stem-Gerlach (SG) devices situated so to (flaw- 
lessly) measure the spins of a pair of correlated elec- 
trons. These are called the left and right devices, re- 
spectively. For definiteness, suppose that the corre- 
lated electrons are in the singlet spin state 

| ip) = ^=( 1 1 )l 1 1 )r _ U)lIT)r) # 

Assume that the left and right SG devices, respec- 
tively, are oriented so that they invariably measure 
spin along z and an axis that differs from z by a com- 
putable angle 9. u 9 computable” simply means that 
there is an algorithm for generating the decimal ex- 
pansion of 9. E.g., 9 = ir/6 is clearly computable. 

Suppose that at neriodic time intervals these devices 
are supplied with identically prepared pairs of corre- 
lated electrons. (This would allow the measurement 
outcomes to serve as a “window” into the hidden 
dynamics of the devices, if such a dynamics did indeed 
exist.) Finally, imagine that each SG device is en- 
dowed with the capability of recording its measure- 
ment outcomes as a string of binary digits — 0 and 1 
denoting down and up outcomes, respectively. Denote 
the left and right strings, respectively, by l and r, and 
their n-length prefixes by /„ and r n . E.g., a typical 
run of the devices might give / = 01101011 ... and 
r = 10110100... ; the length-4 prefixes for these 
strings are / 4 = 0110 and r 4 = 1011. To cap off the 
description of the thought experiment, assume that 
there is in fact an ensemble of such devices: each 

macroscopically identical to the next, each with its 
own supply of electrons, and each performing the op- 
eration described. Associated with this ensemble will 
be an ensemble of ordered pairs (/, r) and consequently 
ensembles of pairs (!„,’'„)• 

With this as a scaffold, the CHV notion can 
be formalized. Simply put, a CHV is said to be re- 
sponsible for the measurement outcomes if for every 
pair (/,r) in the ensemble, there is at least one of a fi- 
nite set of computer programs (more formally recur- 
sive functions) that, for any n, produces the string r n 
as output whenever given /„ as input. Note that each 
string / can have as its origin any process whatsoever: 
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deterministic or indeterministic. This definition only 
requires that a rigid, mechanistic relation between / 
and r be maintained. Furthermore, such a theory is 
inherently nonlocal; the program provides the “medi- 
um” for the instantaneous action of the one string on 
the other. The finiteness of the set of programs is 
meant to allow for the possibility that the SG devices 
in the ensemble might have differing microscopic 
initial conditions. 

The main result may now be stated. Because 
a CHV provides a compression scheme for the mea- 
surement data, it must contradict the statistical pre- 
dictions of quantum mechanics. Suppose the proba- 
bility distributions for the ensembles of strings / and 
pairs (/ n ,r n ) are p n (i n ) and g n (/ n , r n ), respectively. 
The Shannon entropies for these distributions are: 

H n = - T,i n P n (i n )i°g[p n (i n )] 

_ and (2) 

H n = ~ T,l n Er n 9 n (l n ,r n )log[q n (l n ,r n )], 

where log denotes the base-2 logarithm. Consider the 
quantity | H n — H n | . Standard quantum theory re- 
quires that this be proportional to n. For CHV’s, 
however, this quantity is necessarily bound by a con- 
stant independent of n. The remainder of this section 
will be devoted to justifying the quantum mechanical 
result; the corresponding result for a CHV will be 
derived in the next two sections. 

Suppose that standard quantum theory does 
indeed hold in the thought experiment. In that case, 
the required Shannon entropies are straightforward to 
derive. The essential ingredient in this derivation is 
simply noted: quantum theory declares that the only 
condition determining the measurement outcomes is 
the probability distribution derivable from (1). 
Hence, the probability of a 0 or a 1 occurring in the 
fc’th place of a string l n must be independent of k. 
Furthermore, this probability is independent of which 
left-hand SG device in the ensemble produced / . 
Analogous results hold for any string r n and for the 
correlation probabilities between the it’th places of / 
and r n . With these considerations, it is a simple 
exercise in quantum mechanics to show that 

\H n ” H n \ - where 

f(0) =[sin 2 ^log[sin 2 ^] + cos 2 | log[ cos 2 |] ). 

Therefore, for any 9 other than 9 = 0 or 9 = 7 r, 

I “ H n\ « (3) 

IV. Algorithmic Information Theory 

This section introduces enough of the appara- 


tus of algorithmic information theory that the main 
result can be proven. It does not purport to be a gen- 
eral introduction to the subject. The notion of a “re- 
cursive function” is taken as primitive. For the most 
part, this section follows the development of algo- 
rithmic information theory found in Ref. 3. 

Notation and Definitions : 

Let X = {A, 0, 1, 00, 01, ...} be the set of fi- 
nite binary strings in lexicographic order, where A is 
the empty string. Elements of X may be thought of 
dually as strings and natural numbers. Let X n be the 
set of n- length strings. 0(1) denotes a bounded func- 
tion. The variables s, f, v, x, and y denote elements 
of X. The length, n-length prefix, and Jb’th digit of s 
are denoted by | * | , and s(Jb), respectively. A set 
S C X is called an instantaneous code if for any x,y in 
S, neither x nor y is a prefix of the other. Elements of 
S (denoted generally by r) are called programs. A 
computer C is, a recursive function C:SxX-+X. A 
computer U is said to be universal iff for each com- 
puter C there is a constant k c such that: if C(r, v) is 
defined, then there exists a program r' G S such that 
U(r , u) = C(r, v) and | r ; | < | r | + k q . Let a partic- 
ular countably infinite instantaneous code S and uni- 
versal computer U be chosen as standard. Finally, let 
(,):XxX-+X be a recursive bijection with the proper- 
ty that if | « | = | f|. then <«,*)= s(l)t(l)s(2)t(2 ) . . . . 
E.g., if s = Oil and t = 101, then (s, t) = 011011. 

The algorithmic complexities are defined by: 

K c (s/t) = min{|r| : C(r,t) = s} 

*c(*) = K c (s/A) 

K c (s,t) = K c ((s,t)) 

K{s/t) = K v {s/t) 

The canonical program s* for s is defined by s* = 
min{r : £f(r f A) = «}. Clearly \s*\ = K(s). 

Now let p:\— ►[0, 1] and let p n denote the 
restriction of p to X n . p is said to be a probability 
measure for a stochastic process if it satisfies: 

£ Pn(*) = l & Pn-l(») = Pn(»0) + P„(lfl) 

\x\ = n 

for any n and any y £ X n _ j . If p is recursive, then p 
is said to be a computable measure. In this section, 
only computable measures are considered. The Shan- 
non entropy H n for p over n-length strings is: 

^n=~£ Pn( x ) lo 9[P n {z)]- 
| x | = n 

Finally, with the measure p, the average complexities 
( K /y)p an< l (K)p for n-length strings are defined by: 

Wi /)?=£ Pn(*)X(x/y) t {K)” = (K/ A)" 

I X I =n 
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Theorems : 

Theorems (a)-(f), from Ref. 3, are listed so 
that the present treatment will be self-contained. 
Theorem (i), from Ref. 4, provides the link relating 
complexity to entropy. Theorems (g), (h), and (j) are 
simple results due to the author. When crucial, rather 
than relegating bounded terms to a term written as 
0(1), a constant written in the form D , where the 
ellipsis symbolizes a set of subscripts, will be used so 
that all dependencies are clear. (E.g., D p denotes a 
constant that depends only on the measure p .) 


(a) For any computer C, K{s/t) < K c (s/t) + k c . 

(b) K(s)<K(s,t) + 0(1). 

(c) jr(*/()<K(*)+o(i). 

(d) K(s,t)= K(t,s) + 0(1). 

(e) K(s,t) = K(s) + K(t/s*) + 0(i). (To make the 
0(1) term’s dependence on U explicit, this can be 
written as | K(s,t) — K(s) — K(t/ s )| < D jj .) 

(f) If /:N-»N is a recursive function and ^ 

converges, then K(n) < /(n) + 0(l)- 


(g) K(t/s*)<K(t/s) + 0( 1). 

Proof : Consider a computer C such that C(r,v) = 

U(r,U(v, A)). Then C(r,s m ) = U(r,s) . Hence it must 
be the case that K c (t/s‘) = K(t/s) for any t. 
By (a) then, K(t/s * ) < K(t/s) +0(1). □ 

(h) -Dy < K(s)-K(s/\s\ m ) < 2log\s\ +D a . 

Proof : A similar result is derived in Ref. 4. The left- 
hand inequality is a consequence of (c). By (f), there 
is a constant such that K(n) < 2 log(n) + for 
all n. The right-hand inequality then follows from 
successive applications of (b), (d), and (e). D 

(i) There is a constant p such that, for all n, 

0 <(K/nT p ~H n <Vv, p . 

(j) There are constants and D[/ )P such that, for 
all n, -Dy < (A')p - H n < 2 logn + D UfJ ,. 

Proof: This is a simple consequence of (h) and (i). □ 


V. Computable Hidden-Variable Theories 

Armed with the last section’s tools, a precise 
definition of a CHV can now be formed. Let Dt de- 
note the set of all possible pairs (l,r) in the ensemble 
of strings produced by the thought experiment. 

Def: A CHV V is said to be responsible for 

the measurement outcomes if there is a finite subset 
V C S such that for each (/, r)6» there exists a»£V 
for which it is the case that U(v,l n ) = r„ for every n. 


Notice immediately that if V is responsible for the 
outcome strings, then for each (/, r) € K it follows 
that K(r n /l n ) < max{ | v | :v € V) for all n. But then 

by (g), A(r n /C) < D l/ V for a11 n - This ’ coup,ed 
with (e), leads to the forfowing conclusion. 

Thm 1: If V is responsible for the mea- 

surement outcomes, then for each (l,r) € M, it follows 
that | A(/ n ,r„) - K(l n ) I < %,V for al1 n - 

Now consider the probability measures for the 
outcome strings using the notation introduced in Sec- 
tion II. It is assumed that these measures are com- 
putable. (This will be the case if standard quantum 
theory is valid since 6 is required to be computable. 
If it were not the case here, by being noncomputable, 
p and q would trivially differ from the values pre- 
dicted by quantum mechanics and there would be no 
need for further discussion.) For these measures: 

V a (I ,r ) = P„(L) for all n. An important 
fact to note is that q n (s n ,t n ) vanishes iff 

Hence from Theorem 1 it follows that, if V generates 
the measurement outcomes, for all n, the quantity 

I Ei r 

| 4 n’ 7 n n 

will be bounded by a constant y Now because of 
the form of the bijection (l n i r n )i ^be double sum in 
this expression may be construed as a single sum over 
strings of length 2 n. This fact leads to the following. 

Thm 2.: If V is responsible for the outcome 

strings, |(^)* n -Wpl ^ D^y for all n. 

Combining Theorems 2 and (j), the following emerges: 

Thm J: If V is responsible for the outcome 

strings, \H n - H n \ < D u,V,p,q for a ” n ‘ 

This is the sought after identity; for, although 
y depends on the the CHV explicitly (through 

V ancf possibly p and g), it is independent of n. 
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